Performance Modeling of Spatio-Temporal Algorithms Over GEDS Framework

نویسندگان

  • Jonathan Cazalas
  • Ratan K. Guha
چکیده

The efficient processing of spatio-temporal data streams is an area of intense research. However, all methods rely on an unsuitable processor (Govindaraju, 2004), namely a CPU, to evaluate concurrent, continuous spatio-temporal queries over these data streams. This paper presents a performance model of the execution of spatio-temporal queries over the authors’ GEDS framework (Cazalas & Guha, 2010). GEDS is a scalable, Graphics Processing Unit (GPU)-based framework, employing computation sharing and parallel processing paradigms to deliver scalability in the evaluation of continuous, spatio-temporal queries over spatio temporal data streams. Experimental evaluation shows the scalability and efficacy of GEDS in spatio-temporal data streaming environments and demonstrates that, despite the costs associated with memory transfers, the parallel processing power provided by GEDS clearly counters and outweighs any associated costs. To move beyond the analysis of specific algorithms over the GEDS framework, the authors developed an abstract performance model, detailing the relationship of the CPU and the GPU. From this model, they are able to extrapolate a list of attributes common to successful GPU-based applications, thereby providing insight into which algorithms and applications are best suited for the GPU and also providing an estimated theoretical speedup for said GPU-based applications. DOI: 10.4018/jghpc.2012070104 64 International Journal of Grid and High Performance Computing, 4(3), 63-84, July-September 2012 Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. to as Location Based Services (LBS), truly become ubiquitous. Traditionally, mobile object databases augment the standard database model of persistent data storage and complex querying by adding new models and index structures geared to store, track, and process the locations of moving objects efficiently (Guttman, 1984; Beckmann et al., 1990; Gedik & Liu, 2006). R-trees (Guttman, 1984) have been the most popular mechanism for spatial indexing, and many variants have been proposed, including the R*-tree (Beckmann et al., 1990), X-tree (Berchtold et al., 1996), Lazy Update R-tree (Kwon et al., 2002), and a plethora of other suggestions. Additionally, there is a large body of research focused on reducing the computational burden of continuously monitoring and evaluating realtime queries over these mobile objects. Such works include MQM (Cai, Hua, Cao, & Xu, 2006), MobiEyes (Gedik & Lui, 2006), Domino (Wolfson et al., 1999), and CAT (Trajcevski et al., 2004), to name a few. While these models and structures did initially extend the research in this area, the past few years have witnessed the emergence of a new class of data intensive applications that often require the continuous processing of potentially unbounded sequences of transient data, called data streams. Examples include financial tickers, internet traffic, sensor data, and transaction logs. The high arrival rates of these spatio-temporal data streams, coupled with their massive data sizes, makes it infeasible for traditional DBMS techniques to store, query, or index these streams and therefore dictates the need for better solutions. In simplest terms, a data stream can be defined as “a sequence of characters or bits that is too large to be viewed in its entirety” (Hartzman & Watters, 1990). Several works have convincingly argued that the two research fields of spatio-temporal data streams and the management of moving objects can naturally come together (Chandrasekaran & Franklin, 2003; Ghanem et al., 2007; Mokbel et al., 2004). For example, the output of a GPS receiver, monitoring the position of a mobile object, is viewed as a data stream of location updates. This data stream of location updates, along with those from the plausibly many other mobile objects, is received at a centralized server, which processes the streams upon arrival, effectively updating the answers to the currently active queries in real time. From this model, it becomes clear that additional applications could benefit from modeling location updates as streaming data, including, but not limited to, network traffic, time series data, telephone records, weather data, web click streams, and the list goes on. Unfortunately, most of the recent research in data stream management systems (Adabi et al., 2003; Babu & Widom, 2001; Chandrasekaran & Franklin, 2003) is insufficient, as they overlook the spatial and temporal qualities of both the data streams and the continuous queries over these streams. And it is these two qualities, specifically, that distinguish continuous query processing in spatio-temporal data streams from traditional data streams. Because both queries and data can continuously change their locations, spatio-temporal data streams are viewed as a series of location updates rather than the append-only model of classical data streams (Mokbel & Aref, 2008). Additionally, the temporal quality stipulates that a mobile object may be added to or removed from the result set of the spatio-temporal query, an example being GPS-equipped vehicles moving in and out of a query region. Because these queries are continuous in nature, any delay would result in an obsolete response. Therefore, it is vital to procure scalable and efficient algorithms for the processing of continuous spatio-temporal queries over data streams. To this end, SINA was proposed to address this issue by exploiting shared execution and incremental evaluation (Mokbel et al., 2004). The main drawback of SINA was the reliance on physical disk-storage to perform its operations. SOLE was then proposed as a scalable, in-memory algorithm, which uses an incremental evaluation paradigm and a grid structure to evaluate concurrent, continuous spatio-temporal queries over data streams (Mokbel & Aref, 2008). SOLE avoids the slow, physical data storage, but is also limited based on memory. As a result, a load-shedding 20 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/performance-modeling-spatiotemporal-algorithms/69806?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Computer Science, Security, and Information Technology. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling of spatio-temporal of albedo over Iran

The aim of this study is modeling spatiotemporal variations of albedo. This study was conducted using simultaneous effects of several components, such as wetness of surface layer of soil, cloudiness, topography and vegetation density (NDVI), using MEERA2 model with a resolution of 50 in 50 km during 2000-2010 in Iran. The results of spatial analysis of albedo values in Iran showed that the high...

متن کامل

Spatio-temporal analysis of diurnal air temperature parameterization in Weather Stations over Iran

     Diurnal air temperature modeling is a beneficial experimental and mathematical approach which can be used in many fields related to Geosciences. The modeling and spatio-temporal analysis of air Diurnal Temperature Cycle (DTC) was conducted using data obtained from 105 synoptic stations in Iran during the years 2013-2014 for the first time; the key variable for controlling the cosine term i...

متن کامل

Context-aware Modeling for Spatio-temporal Data Transmitted from a Wireless Body Sensor Network

Context-aware systems must be interoperable and work across different platforms at any time and in any place. Context data collected from wireless body area networks (WBAN) may be heterogeneous and imperfect, which makes their design and implementation difficult. In this research, we introduce a model which takes the dynamic nature of a context-aware system into consideration. This model is con...

متن کامل

The LICORS Cabinet: Nonparametric Algorithms for Spatio-temporal Prediction

Spatio-temporal data is intrinsically high dimensional, so unsupervised modeling is only feasible if we can exploit structure in the process. When the dynamics are local in both space and time, this structure can be exploited by splitting the global field into many lower-dimensional “light cones”. We review light cone decompositions for predictive state reconstruction, introducing three simple ...

متن کامل

A New Wavelet Based Spatio-temporal Method for Magnification of Subtle Motions in Video

Video magnification is a computational procedure to reveal subtle variations during video frames that are invisible to the naked eye. A new spatio-temporal method which makes use of connectivity based mapping of the wavelet sub-bands is introduced here for exaggerating of small motions during video frames. In this method, firstly the wavelet transformed frames are mapped to connectivity space a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJGHPC

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2012